Inferring single-cell gene expression mechanisms using stochastic simulation
نویسندگان
چکیده
MOTIVATION Stochastic promoter switching between transcriptionally active (ON) and inactive (OFF) states is a major source of noise in gene expression. It is often implicitly assumed that transitions between promoter states are memoryless, i.e. promoters spend an exponentially distributed time interval in each of the two states. However, increasing evidence suggests that promoter ON/OFF times can be non-exponential, hinting at more complex transcriptional regulatory architectures. Given the essential role of gene expression in all cellular functions, efficient computational techniques for characterizing promoter architectures are critically needed. RESULTS We have developed a novel model reduction for promoters with arbitrary numbers of ON and OFF states, allowing us to approximate complex promoter switching behavior with Weibull-distributed ON/OFF times. Using this model reduction, we created bursty Monte Carlo expectation-maximization with modified cross-entropy method ('bursty MCEM(2)'), an efficient parameter estimation and model selection technique for inferring the number and configuration of promoter states from single-cell gene expression data. Application of bursty MCEM(2) to data from the endogenous mouse glutaminase promoter reveals nearly deterministic promoter OFF times, consistent with a multi-step activation mechanism consisting of 10 or more inactive states. Our novel approach to modeling promoter fluctuations together with bursty MCEM(2) provides powerful tools for characterizing transcriptional bursting across genes under different environmental conditions. AVAILABILITY AND IMPLEMENTATION R source code implementing bursty MCEM(2) is available upon request. CONTACT [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
منابع مشابه
Gene expression Inferring single-cell gene expression mechanisms using stochastic simulation
Motivation: Stochastic promoter switching between transcriptionally active (ON) and inactive (OFF) states is a major source of noise in gene expression. It is often implicitly assumed that transitions between promoter states are memoryless, i.e. promoters spend an exponentially distributed time interval in each of the two states. However, increasing evidence suggests that promoter ON/OFF times ...
متن کاملSINCERITIES: inferring gene regulatory networks from time-stamped single cell transcriptional expression profiles
Motivation Single cell transcriptional profiling opens up a new avenue in studying the functional role of cell-to-cell variability in physiological processes. The analysis of single cell expression profiles creates new challenges due to the distributive nature of the data and the stochastic dynamics of gene transcription process. The reconstruction of gene regulatory networks (GRNs) using singl...
متن کاملSemi-quantitative Analysis of Expression of Various Genes in relation to Possible Markers for Theileria annulata Attenuation
The sporozoites of Theileria annulata invade bovine MHC II cells, where they differentiate into schizonts. The later can immortalize and induce fundamental changes in their host cells. Live attenuated vaccine is an important way of controlling T. annulata infection of cattle. Production is by prolonged cultivation of macroschizont-infected cells. The mechanisms underlying this transformation ...
متن کاملTherapeutic Efficacy Analysis of lncRNA NEAT1 Gene Knockout and Apoptosis Induction in Prostate Cancer Cell Line Using CRISPR/Cas9
Background and Objective: Long non-coding ribonucleic acid (lncRNA) has been identified as an important gene regulator and prognostic marker in various cancers. The present study aimed to investigate the effects of Nuclear Paraspeckle Assembly Transcript1 (NEAT1) gene knockout using Clustered Regularly Interspaced Short Palindromic Repeats-associated Protein 9 (CRISPR/Cas9) in PC-3 cell line. ...
متن کاملA Hierarchical Bayesian Model for Inference of Copy Number Variants and Their Association to Gene Expression By
A number of statistical models have been successfully developed for the analysis of high-throughput data from a single source, but few methods are available for integrating data from different sources. Here we focus on integrating gene expression levels with comparative genomic hybridization (CGH) array measurements collected on the same subjects. We specify a measurement error model that relat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 31 9 شماره
صفحات -
تاریخ انتشار 2015